Online learning on a continuum

نویسنده

  • Walid Krichene
چکیده

We study a sequential decision problem on a subset S ⊂ R. A decision maker chooses, on iteration t, a probability distribution π over S, then discovers a bounded loss function ` : S → [0,M ], and incurs the expectation Es∼π(t) `(s). The cumulative regret of the decision maker is then ∑T t=1 Es∼π(t) [`(x)] − infs∈S ∑t τ=1 ` (s). We investigate conditions under which one can guarantee a sublinear regret. Previous studies consider the case where S is convex: if the losses are convex, then a simple gradient descent algorithm guarantees a O( √ t) regret, and if the losses are exp-concave, a generalized Hedge algorithm guarantees a O(log t) regret. Building on this previous work, we relax the convexity assumption on S, and propose a generalized Hedge algorithm with a O( √ t log t) bound on the regret when the losses are Lipschitz (uniformly in time) and S is uniformly fat (a weaker condition than convexity). We compare our method to working with a finite cover of the set. In particular, we show that both guarantee a O( √ t log t) bound on the regret, but our method does not need to explicitly compute a cover.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Effect of Online Learning Tools on L2 Reading Comprehension and Vocabulary Learning

The aim of this study was to investigate the effects of various online techniques (word reference, media, and vocabulary games) on reading comprehension as well as vocabulary comprehension and production. For this purpose, 60 language learners were selected and divided into three groups, and each group was randomly assigned to one of the treatment conditions. In the first session of tre...

متن کامل

Online-learning: exploring practices among Foundation doctors

Introduction: Postgraduate medical education involves the use ofonline-learning tools. However, there is a paucity of data on theuse of online-learning among doctors who are in their 1st and 2ndyears of professional work after graduating from medical school(also known as Foundation doctors). Our aim was to explore theuse of online-learning among Foundation doctors.Methods: A cross-sectional stu...

متن کامل

Lifelong learning along the education and career continuum: metaanalysis of studies in health professions

Introduction: Lifelong learning is an integral part of healthprofessionals’ maintenance of competence. Several studies haveexamined the orientation toward lifelong learning at variousstages of the education and career continuum; however, none haslooked at changes throughout training and practice. The objectiveof the present study was to determine if there are differencesbetween groups defined b...

متن کامل

Correlation between Online Learner Readiness with Psychological Distress related to e-Learning among Nursing and Midwifery Students during COVID-19 pandemic

Introduction: With the sudden shift of face-to-face education to e-learning during the COVID-19 pandemic, awareness of learnerschr('39') readiness for online learning and its impact on studentschr('39') psychological distress related to e-learning is important for teachers, counselors, and educational planners. Therefore, the present study was conducted to investigate the correlation between on...

متن کامل

Investigating university students' views on online learning

Online learning is a concept that has received attention due to new technologies in the field of education; But today, due to the sudden spread of the corona virus, online learning has become common, so that most of the higher education institutions organize online learning courses. However, for many students, especially new undergraduate students who are used to the traditional learning enviro...

متن کامل

A New Fuzzy Stabilizer Based on Online Learning Algorithm for Damping of Low-Frequency Oscillations

A multi objective Honey Bee Mating Optimization (HBMO) designed by online learning mechanism is proposed in this paper to optimize the double Fuzzy-Lead-Lag (FLL) stabilizer parameters in order to improve low-frequency oscillations in a multi machine power system. The proposed double FLL stabilizer consists of a low pass filter and two fuzzy logic controllers whose parameters can be set by the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014